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ABSTRACT 



Disclosed is an inventive method for providing visual chat. 
A character image is read into memory representing a 
character a user wishes to be for the duration of the visual 
chat. Continuous frames of video images arc then received, 
typically using video camera, which include image data of 
a person. The bead image of the person is then tracked by the 
system, and portions of the head image are extracted from 
the video images. These extracted portions are preferably 
features of the person in the video image. Finally, the 
extracted portions of the head image are blended into 
corresponding areas of the charaaer image, such that the 
features of the blended character image match the features of 
the person, and change as the features of the person change. 
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SYSTEM AND METHOD FOR VISUAL CHAT 

BACKGROUND OF THE INVENTION 

The present invention relates to video oommunication 
and, more particularly, to a system and method for video 
chat. 

The World Wide Web of the Internet is the most success- 
ful distributed application in the history of computing. In the 
Web environment, client machines effect transactions to 
Web servers using the Hypertext Transfer Protocol (HTTP), 
which is a known Application protocol providing users 
access to files (e.g., text, graphics, images, sound, video, 
etc.) using a standard page description language known as 
Hypertext Markup Language (ITTML). HTML provides 
basic document formatting and allows the developer to 
specify "links" to other servers and files. In the Internet 
paradigm, a network path to a server is identified by a 
so-called Uniform Resource Locator (URL) having a special 
syntax for defining a network connection. Use of an HTML- 
compatible browser (e.g., Netscape Navigator) at a client 
machine involves specification of a link via the URL. In 
response, the client makes a request to the server identified 
in the link and receives in remm a document formatted 
according to HTML. 

In this environment, a user has many options of bow to 
communicate with others. One option is the use of electronic 
mail (email). Using email, a user is able to send messages to 
others users of the Internet anywhere a connection to the 
Internet can be established. Email generally allows a user to 
send typed messages and attachments, such as image files, or 
soimd files. However, email does not allow constant inter- 
action between users of the Internet. Similar to sending a 
letter, a user of email must wait for the intended email 
recipient to receive the message, compose a response, send 
the response, and then wait for the response to reach them. 

In a step toward increased participant interaction, chat 
rooms were created. A typical chat room allows participants 
to type messages, which are received by other participants 
nearly instantaneously. Although the quickness of this 
response increases the amount of partidpanl interaction over 
email, a chat room does not approach the interaction avail- 
able when using common a device as simple as an ordinary 
telephone. 

A further step toward increased participant interaction is 
video chat. Prior Art FIG. 1 is an illustration showing a 
typical computer system 10 suitable for video chat. The 
system includes a computer processor 12, a monitor 14, a 
microphone 16, and a camera 18. A user 20 is typically 
situated in firont of the computer system 10. As the user 20 
speaks, the microphone 16 detects the user's voice and 
transmits the sound to the computer processor 12, typically 
in the form of a digital signal. Simultaneously, the camera 18 
transmits video images of the user 20 to the computer 
processor 12. 

The video and audio signals are then transmitted, typi- 
cally in compressed form, to a recipient's computer, where 
they are decompressed and viewed/heard. In this manner, 
two users of the Internet, or any other network, can com- 
municate with one another in a highly interactive 
environment, wherein each user can both see and hear the 
person they are conununicating with. However, there are 
problems with conventional video chat 

One problem is the extreme amount of bandwidth 
required to use video chat effectively. Streaming video, 
utilized by video chat, uses an extreme amount of band- 
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width. Increased bandwidth is* required for* clearer .and- • >< i-^^ - 
smoother video images. Moreover, since the requirement on 
bandwidth is so high, a typical home computer system 
generally cannot handle more than a two-way video chat 

s session. Each additional video chat participant greatly 
increases the amount of required bandwidth. Thus, having 
more than two video chzx participants using typical home 
computer systems generally drains system resources such 
that the systems are imable to perform desirably. 

10 Another problem with video chat is the loss of anonymity 
among participants. Using email, or non-video dial, partici- 
pants are able to remain more or less anonymous, since 
participants cannot actually see each other. \^deo chat, on 
the other hand, allows participants to visually identify one 

15 another. Jhis may cause problems, especially with children, 
when strangers are essentially allowed to "look" into the 
homes of other users. 

In view of the forgoing, what is needed are improved 
methods and systems for providing video chat at a reduced 

20 bandwidth, thus allowing multiple participants on a typical 
home computer system. Further, there is a need for methods 
and systems for allowing anonymity during video chat, thus 
allowing users to enjoy the highly interactive nature of video 
chat without being visually identified by strangers. 

" SUMMARY OF THE INVENTION 

The present invention addresses these needs by providing 
an improved visual diat method. In one embodiment, a 
character image is read into memory representing a charac- 

30 ter a user wishes to be for the duration of the visual chat. 
Continuous frames of video images are then received, typi- 
cally using a video camera, which include image data of a 
person. The head image of the person is then tracked by the 
system, and portions of the head image are extracted from 

35 the video images. These extracted portions are preferably 
features of the person in the video image. Finally, the 
extracted portions of the head image are blended into 
corresponding areas of the character image, such that the 
features of the blended character image match the features of 

4Q the person, and change as the features of the person change. 
In another embodiment, a computer system is disclosed 
for providing visual chat. The system includes a computer 
processor, memory for storing character images, a camera, 
and logic for performing visual chat operations. In use, the 

45 camera transmits video images of a person to the computer 
processor. Thereafter, the logic performs visual chat opera- 
tions including tracking a head portion of a person, obtaining 
a portion of the person image from the head portion, and 
blending the obtained portion of the head image into the 

50 character image. 

A computer program for providing visual chat is disclosed 
in yet another embodiment of the present invention. The 
computer program includes a character input code segment 
that reads a character image into memory representing a 

55 character a person wishes to be for the duration of the visual 
chat, and a video code segment that receives video images 
of the person. The computer program also includes a track- 
ing code segment that tracks a head portion of the person, 
after which a gathering code segment obtains portions of the 

60 head portion from the person's image. These extracted 
portions are preferably features of the person in the video 
image. Finally, the extracted portions of the head image are 
blended into corresponding areas of the character image by 
a blending code segment. Preferably, the blending is per- 

65 formed such that the feamres of the blended character image 
match the features of the person, and change as the features 
of the person change. 
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•Advantageously, the present invention allows useis to be 
other "characters" for the duration of the video chat session. 
Moreover, the character obtains the same featvu^es as the 
user, in a manner similar to the look achieved by having a 
makeup artist, using prosthetics and makeup, disguise the 5 
user as the character. The character actually "looks" like the 
user might "look" if the user actually was the character, i.e., 
the blended character has essentially the same features as the 
user, and blinks and smiles as the user does. Moreover, the 
present invention allows visual chat at a reduced bandwidth, lo 
thus allowing many individuals to participate in the same 
video chat session. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Prior Art FIG. 1 is an illustration showing a typical prior is 
art computer system suitable for video chat; 

FIG. 2 is an illustration showing a system configuration 
for visual chat, in accordance with an embodiment of the 
present invention; 

FIG. 3 is an illustration showing an image of a user's face, 
in accordance with one embodiment of the present inven- 
tion; 

FIG. 4 is an illustration showing a character image, in 
accordance with another embodiment of the present inven- 
tion; 

FIG. 5 is an illxistration showing an image of a user's face 
including feature locations, in accordance with an embodi- 
ment of the present invention; 

FIG. 6 is an illustration showing sampled feature 30 
locations, in accordance with an embodiment of the present 
invention; 

FIG. 7 is an illustration showing a blended character 
image, in accordance with another embodiment of the 
present invention; 35 

FIG. 8 is an illustration showing a sampled feature 
location, in accordance with an embodiment of the present 
invention; 

FIG. 9 is a graph lowing the red color channel values for 
individual person pixels and their relation to the red color ^ 
channel of the average flesh tone, in accordance with an 
embodiment of the present invention; 

FIG. 10 is an illustration showing a modification of a 
character feature location, in accordance with another 
embodiment of the present invention; 

FIG. 11 is a flowchart showing a method for providing 
visual chat, in accordance with one embodiment of the 
present invention; 

FIG. 12 is a flowchart showing a method for blending the 
user's features into the character image, in accordance with 
one embodiment of the present invention; 

FIG. 13 is an illustration showing an image of a user with 
a head portion being tracked, in accordance with another 
embodiment of the present invention; 

FIG. 14 is an illustration showing the image of the user 
including a line of symmetry, in accordance with an embodi- 
ment of the present invention; 

FIG. 15 is an iUustration showing a graph of symmetry 
test locations, in accordance with one aspect of the present 50 
invention; 

FIG. 16 is an illustration showing the image of the user 
including feature sample lines, and feature fines, in accor- 
dance with an embodiment of the present invention; 

FIG. 17 is an illustration showing results along the feature ss 
sample fines, in accordance with an embodiment of the 
present invention; 
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■ FIG. 18 is an iUustration showing the image of the user- s 
including the interocular spacing, in accordance with an 
embodiment of the present invention; 

FIG. 19 is a graph showing the interocular pacing of a 
user, in accordance with an aspect of the present invention; 

FIG. 20 is a flowchart showing a method for tracking the 
user's face, in accordance with one aspect of the present 
invention; 

no. 21 is a flowchart showing a method for determining 
feature placement, in accordance with another embodiment 
of the present invention; and 

FIG. 22 is a flowchart showing a method for determiiung 
interocular spacing, in accordance with one aspect of the 
present invention. 

DETAIL DESCRIPTION OF THE INVENTION 

FIG- 1 was described in terms of the prior art, A preferred 
embodiment of the present invention wiU now be described 
with reference to FIG. 2. In the following description, 
numerous specific details are set forth in order to provide a 
thorough understanding of the present invention. It wiU be 
obvious, however, to one skilled in the art, that the present 
invention may be practiced without some or aU of these 
specific details. In other instances, well known process steps 
have not been described in detail in order not to unneces- 
sarily obscure the present invention. 

FIG. 2 is an iUxistration showing a system configuration 
30 for visual chat, in accordance with an embodiment of the 
present invention. The system 30 comprises an encoding 
system 32, a decoding system 34, and an optional network 
connection 36. The encoding system 32 includes a camera 
18, a face detector 38, a feature extractor 40, a color ratio 
analyzer 42, and an optional compressor 44. Also included 
in the encoding system 32 are Random Access Memory 
(RAM) 46, and Read Only Memory (ROM) 48. 

The decoding system 34 includes an optional decompres- 
sor 50, a raorph Blender 52, artwork memory 54, and 
monitor 14. The encoder system 32 and decoding system 34 
are connected through connection 60. 

In use, a user is typically located in front of camera 18, as 
shown in Prior Art FIG. 1. Referring back to FIG. 2, images 
of the user are received by camera 18 and transmitted to the 
face detector 38. The face detector 38 then tracks a head 
portion of the user located within the received video images. 
In addition, the face detector 38 tracks the face of the user, 
and determines feature locations, such as the eyes and 
mouth, of the user. In this manner, the locations of the users 
features are essentially ahvays known by the encoding 
system 32, even v/hcn the user's head and face move about 
within the video images. Accurate locations of the user's 
features are preferably maintained for the duration of the 
visual chat to enable acctirate blending of the user's features 
into a character image, described subsequently. 

After tracking the head of the user and determining the 
location of the user's features, the feature extractor 40 uses 
the feature location information to extract images of the 
user's feamres fi-om the video images. Preferably, the feature 
extractor 40 extracts images of the user's eyes and mouth 
fi^om the video images. In addition, the feature extractor 40 
preferably processes the extracted images to enhance their 
blendablility, as described in more detail subsequently. 

The color ratio analyzer 42 then analyzes the extracted 
features and generates color ratios for the extracted feature. 
The cobr ratios represent the relative brightness and/or 
darkness of pixels with the extracted featiu'e relative to an 
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• average flesh tone of rlhe extracted feature. Compressor 44 .'Ceming the sampled feature locations to the decoder systems 

then optionally compresses the feature color ratios. of other visual chat users. Thus, rather than transmitting the 

Generally, when visual chat is used with a computer entire video image to other users, only data concerning the 

network, the feature color ratios are preferably compressed sampled feature locations is transmitted to other users. This 
in order to reduce the bandwidth requirement of the system 5 residts in an extreme reduction in required bandwidth for the 

30. Much higher compression is attainable by compressing system, in many cases, as much as a 20:1 reduction in 

only the feature color ratios, rather the extracted images required bandwidth. 

themselves. Thus, rather than sending image pixel data to the piG. 7 is an illustration showing a blended character 

receiving computer, the present invention transmits only the image 80, in accordance with another embodiment of the 
feature color ratios. However, the present invention may be lO present invention. The blended character image includes the 

utilized on only one computer system, for example, to see a artwork of the original character image 72, and the sampled 

user's features on various character images. In this case, fcanirc locations 76, blended into the character image. As 

feature color ratio compression would not be necessary, as shown in HO. 7, the feamre locations 76 are blended into the 

will be apparent to those skilled m the art. character image, such that the features of the blended 
After compression, decompressor 50 receives the com- i5 character image 80 essentially match the user's features. By 

pressed feature color ratios and decompresses the feature blending the user's features into the character image 72, the 

color ratios. Typically, the compressed feature color ratios blended character image 80 essentially maintains textures 

are transmitted over network 36, however, as stated above located in the same are as the blended sampled features 76. 

the present invention may be utilized on one computer In should be borne in mind that the sampled features 76 are 
system. In this case, the decompressor 50 may be bypassed ^0 continuously updated as video information is sent to the 

if the feature color ratios were not compressed. It should be system by the system camera. Thus, the effect obtained is 

borne in mind that both the encoder system 32 and decoder similar to the effected obtained if one were looking at a 

system 34 interact with each other, and with the encoder and video of only the user's features, such as their eyes. Thus, 

decoder systems of other computer systems. During normal the features 76 change as the user blinks, talks, smiles, etc. 

visual chat, information transmitted by the encoder system Xo accomplish a smooth blending effect, the sampled fea- 

32 is transmitted to both decoder system 34 through con- tures are preferably processed, as described in detail subse- 

nection 60, and to a decoder system of another computer quently. 

system through network connection 36. 8 is an illustration showing a sampled feahire 
After decompression, the morph blender 52 receives the location 76, in accordance with an embodiment of the 
decompressed feature color ratio information. In addition, ^ present invention. The sample feature location 76 includes 
the morph blender 52 receives character image artwork from an inner area 82, a middle area 84, and an outer area 88. 
the artwork memory 54. The morph blender 52 then alters After sampling, the sample feature location 76 is processed 
pixels of character image utilizing the feature color ratio by a "feathering" technique, such that the inner area 82 
information. In this manner, the features of the user's image remains opaque and the outer area 88 becomes near trans- 
are blended into the character image. Using this technique, parent Feathering produces a smooth transition from the 
texnires of the character image are generally maintained opaqueness of the inner area 82 to the outer area 88, as will 
while still having features of the user's image blended into be apparent to those skilled in the art. 
the character image. The decompressed image is displayed After feathering, an average flesh tone is determined for 
on the monitor 14. In addition, the system may have a the sample feature location 76. The average flesh tone is 
microphone which for Uansmitting the users voice as audio preferably determined using a sampling zone 90. The sam- 
data. As will be apparent to those skilled in the art, the audio piing zone 90 inchides the middle area 84 and the outer area 
data may be transformed as appropriate, to suit the selected 88 of the sample feattire location 76. Preferably, extreme 
character. areas within the sample zone 90 are ignored when deter- 
FIG. 3 is an illustration showing an image 70 of a user's mining the average flesh tone. Extreme areas include areas 
face, in accordance with one embodiment of the present of extreme darkness such as the eyebrows. There arc various 
invention. As stated above, the present invention tracks the techniques to determine the average flesh tone of a sample 
user's face in order to determine the feature locations within feature location 76, as will be apparent to those skilled in the 
the image. The system also includes at least one character art 

image 72, as shown in HO. 4. The character image 72 Each sample feature location 76 includes a plurality of 

preferably mcludes blended out character feature locations pixels, termed "person pixels." Each person pixel includes 

74, such as the eyes and mouth. In this manner, the feamres three color channels, red, green, and blue, as will be apparent 

of the user's unage 70 can be blended into the character to those skilled in the art. For blending purposes, each color 

image 72 at the locations of the blended out character channel of each person pixel is compared to a corresponding 

features locations 74. color channel of the average flesh tone. 

FIG. 5 is an illustration showing an image 70 of a user's RG. 9 is a graph 100 showing red color channel values for 

face including feamre locations 76, in accordance with an individual person pixels 102 and their relation to the red 

embodiment of the present invention. Once the user's face color channel of the average flesh tone 104, in accordance 

has been tracked, the location of the user's features 76 is with an embodiment of the present invention. The graph 100 
known by the system. The system then samples these feature ^ includes a saturation value 106 and a null vahie 108. The 

locations in a radius centered on each feaUirc location 76. saturation value 106 is the vahie of the color channel at 

Preferably, the feaUirc locations 76 include the eyes and ioo%, and the nuU vale 108 is the value of the channel at 

mouth of the user's image. q% ^ feamre color ratio is determined for each channel 

FIG. 6 is an illustration showing the sampled feature value of each person pbcel. The determination of a feature 
locations 76, in accordance with an embodiment of the 65 color ratio depends on whether the color channel value for 

present invention. Once the sampled feature locations 76 are a person pixel is greater than or less than the corresponding 

obtained, the system transmits processed information con- color channel value for the average flesh tone. 
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. For color channel values of person pixels 102 greater than 
or equal to the corresponding color channel value of the 
average flesh tone 104, a person pixel delta 110 is compared 
to a lightening flesh tone delta 112. The person pixel delta 
110 is the difference between the color channel value for the 
person pixel 102, and the color channel value for the average 
flesh tone 104. The lightening flesh tone delta 112 is the 
difference between the saturation value of the color channel 
106, and the color channel value for the average flesh tone 
104. Thus, the feature color ratio for lighter person pixels is: 

person pixel delta 110: lightening flesh tone delta 112, 

For color channel values of person pixels 102 less than the 
corresponding color channel value of the average flesh tone 
104, the person pixel delta HO is compared to a darkening 
flesh tone delta 114. The darkening flesh tone delta 114 is the 
color channel value of the average flesh tone 104. Thus, the 
feature color ratio for darker person pixels is: 

person pixel delta 110: darkening flesh tone delta 114. 

Note, the person pixel delta 110 are positive for color 
channel values of person pixels greater than the color 
channel value for the average flesh tone, and negative for 
color channel values of person pixels less than the color 
channel value for the average flesh tone. Thus, the corre- 
sponding feature color ratios are positive for color chaimel 
values greater than the color channel value for the average 
flesh tone, and negative for color channel values of person 
pixek less than the color diannel value for the average flesh 
tone. In addition, feature color ratios are preferably deter- 
mined for each person pixel, for each of the three color 
channels. Thus, similar to the red color channel of FIG. 9, 
feature color ratios are determined for the blue color 
channel, and green color charmel. 

FIG. 10 is an illustration showing a modification of a 
character feature location 74, in accordance with another 
embodiment of the present invention. The character feature 
location 74 includes individual character pixels 120. Each 
character pixel 120 includes three color channels, red, green, 
and blue, as will be apparent to those skilled in the art. For 
blending purposes, each color channel of each character 
pixel is modified using the corresponding color channel 
feature color ratio of a corresponding person pixel. 

Also shown in FIG, 10 is a graph 122 showing of the red 
color channel. As in FIG. 9, the graph includes the saturation 
value 106 and the null value 108 for the red color channel. 
A feature color delta 124 is determined for each channel 
value 126 of each character pixel 120. The determination of 
a feature color delta 124 depends on whether the feature 
color ratio for the corresponding person pixel is positive or 
negative. 

For positive feature color ratios, a lightening character 
pixel delta 128 is multiplied by the feature color ratio. The 
lightening character pixel delta 128 is the difference between 
the satiu-ation value of the color channel 106, and the color 
channel value 126 for the character pixel. Thus, the feature 
color delta 124 is: 

(Ughtening character pixel delta 128)xfeature color ratio. 

For negative feature color ratios, a darkening character 
pixel delta 130 is multiplied by the feature color ratio. The 
darkening character pixel delta 128 is the color channel 
value 126 for the character pixel. Thus, the feature color 
delta 124 is: 

(darkening character pixel delta 130)xfeature color ratio. 

Note, the feature color delta 124 is positive for positive 
feature color ratios, and negative for negative feature color 
ratios. In addition, feature color deltas are preferably deter- 
mined for each character pixel, for each of the three color 
channels. Thus, similar to the red color channel of FIG. 10, 
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feature color deltas* are determined fortt the- blue^ color 
channel, and green color channel. 

The feature color delta is then added to the color charmel 
value 126 of the charaaer pixel. It should be borne in mind 

5 that for negative feature color deltas the color diannel value 
is reduced. This is repeated for each of the three color 
channel values for each character pixel 120. 

FIG. U is a flowchart showing a method 200 for provid- 
ing visual chat, in accordance with one embodiment of the 

10 present invention. In an initial operation 202, pre-process 
operations are performed. E>re-process operations include 
initializing the video camera connection, and establishing a 
visual chat connection. 

In a character reading operation 204, a character image is 

15 read into memory. A user preferably selects the character 
image from a plurality of character images. A character 
image preferably represents some type of face. However, the 
character image may represent any image that the user 
wishes to represent them in the visual chat session. 

20 Next in a video receiving operation 206, video images of 
the user are received by the system. Typically, the video 
images are obtained using a camera mounted near the user's 
computer. Generally, the video images include the face of 
the user, and preferably the user's face is the dominant focus 

25 of the video images. 

A head portion of the video images is then tracked in a 
head traddng operation 208. Preferably, the face of the user 
is tracked in addition to the head of the user. By tracking the 
face of the user, the system is able to determine the location 

30 of the user's features within the video imagps. Careful 
tracking of the user's features is an important component in 
creating a smooth blending of the user's features into the 
character image. 

After tracking the features of the user, features of the user 

35 arc obtained from the video images in a feature obtaining 
operation 210. Preferably, an area surrounding each of the 
user's eyes and the user's mouth is obtained from the video 
images. This area may then be "feathered" to enhance the 
blendabihty of the feature location, as described above. By 

40 reducing the video images to only the feature locations, the 
present invention reduces the required bandwidth of the 
visual chat 20:1 over conventional visual chat applications. 
Moreover, inforaaation conceming the user's features is 
typically transmitted to another visual chat user, and then 

45 blended into a character image. 

Next, in a blending operation 212, the obtained features 
are blended into the character image. In this manner, the 
features of the user are blended into the features of the 
character image, resulting in a character image that mimics 

50 the user Moreover, textures on the character image are 
retained while still allowing the user's features to be blended 
into the character image. 

Finally, in operation 214, the character image is used to 
represent the user in the visual chat session. Because of the 

55 reduced bandwidth required by the present invention, mul- 
tiple users may be involved in the same visual chat session 
essentially without causing unreasonably system resource 
problems. Moreover, the use of a character image to repre- 
sent the user allows the user to retain most of anonymity 

60 available in text based chat sessions. 

Referring next to FIG. 12, a method 212 for blending the 
user's features into the character image is shown, in accor- 
dance with one embodiment of the present invention. In an 
initial operation 300, an image of a feature is obtained from 

65 the user's video image. Preferably, this is accomplished by 
tracking the user's face to determine the location of the eyes 
and mouth of the user. 
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. Id an averaging operation 302; the average flesh. tone .£or' • - 
the feature is deteimiDed. Preferably the feature image is 
feathered around the edges to increase blendability. After 
feathering, the average flesh tone for the feature is deter- 
mined. Preferably, extreme areas within the feature image 5 
are ignored when determining the average flesh tone. 
Extreme areas include areas of extreme darkness, such as the 
eyebrows. There are various techniques to determine the 
average flesh tone of a feature image, as will be apparent to 
those skilled in the art. 10 

Next, in color ratio operation 306, color ratios for arc 
determined for the feature image. Each feattire image 
includes a pluraUty of pixels, termed "person pixels." Each 
person pixel includes three color channels, red, green, and 
blue, as will be apparent to those skilled in the art. For 15 
blending purposes, each color channel of each person pixel 
is compared to a corresponding color channel of the average 
flesh tone. Moreover, the determination of a feature color 
ratio depends on whether the color channel value for a 
person pixel is greater than or less than the corresponding 20 
color channel value for the average flesh tone. 

For color channel values of person pixels greater than or 
equal to the corresponding color channel value of the 
average flesh tone, a person pixel delta is compared to a 
lightening flesh tone delta. The person pixel delta is the 25 
difference between the color channel value for the person 
pixel, and the color chaimel value for the average flesh tone. 
The lightening flesh tone delta is the difference between the 
saturation value of the color channel, and the color chaimel 
value for the average flesh tone. Thus, the feature color ratio 30 
for lighter person pixels is: 

person pixel delta: lightening flesh tone delta. 

For color channel values of person pixels less than the 
corresponding color channel value of the average flesh tone, 
the person pixel delta is compared to a darkening flesh tone 35 
delta. The darkening flesh tone deha is the color chaimel 
value of the average flesh tone. Thus, the feature color ratio 
for darker person pixels is: 

person pixel delta: darkening flesh tone delta. 

Note, the person pixel delta are positive for color channel 40 
values of person pixels greater than the color channel value 
for the average flesh tone, and negative for color channel 
values of person pixels less than the color channel value for 
the average flesh tone. Thus, the corresponding feature cobr 
ratios are positive for color channel values greater than the 45 
color channel value for the average flesh tone, and negative 
for color channel values of person pixels less than the color 
channel value for the average flesh tone. In addition, feature 
color ratios are preferably determined for each person pixel, 
for each of the three color channels. Thus, feature color 50 
ratios are determined the red color channel, the blue color 
channel, and green color chaimeL 

In a character blending operation 308, the color ratios are 
used to blend the user's features into the character image at 
the character feature locations. Each character feature loca- 55 
tion includes individual character pixels. Each character 
pixel includes three color channels, red, green, and blue, as 
will be apparent to those skiUed in the art. For blending 
purposes, each color channel of each character pixel is 
modified using the corresponding color channel featm^ 60 
color ratio of a corresponding person pixel. 

A feature color delta is determined for each channel value 
of each character pixel. The determination of a feature color 
delta depends on whether the feature color ratio for the 
corresponding person pixel is positive or negative. 6S 

For positive feature color ratios, a tightening character 
pixel delta is multiplied by the feature color ratio. The 
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•lightening character pixel deltais the difference between the-, 
saturation value of the color channel, and the color channel 
value for the character pixel. Thus, the feature color delta is: 

(lightening character pixel delta)xfeattire color ratio. 

For negative feature color ratios, a darkening character 
pixel delta is multiplied by the feature color ratio. The 
darkening character pixel delta is the color channel value of 
the character pixel. Thus, the feature color delta is: 

(darkening character pixel delta)xfeature color ratio. 

Note, the feature color delta is positive for positive feature 
color ratios, and negative for negative feature color ratios. In 
addition, feature color deltas are preferably determined for 
each character pixel, for each of the three color channels. 
Thus, feature color deltas are determined for the red cobr 
channel, the blue color channel, and the green color channel. 

The feature color delta is then added to the color channel 
value of the character pixel. It should be borne in mind that 
for negative feature color deltas the color channel value is 
reduced. This is repeated for each of the three color channel 
values for each character pixel. 

Finally, in operation 310, the blended character image is 
used to represent the user in the visual chat session. Because 
of the use of color ratios for altering the character image, the 
character image essentially retains all textures while stfll 
allowing the user's features to be blended into the character 
image. 

To determine the location of a user's features, the present 
invention tracks the user's head and face. RG. 13 is an 
niustration showing an image 400 of a user with a head 
portion 402 t>eing tracked, in accordance with another 
embodiment of the present invention. The image 400 
includes a head portion 402 of the user, and a bead tracking 
box 404. In use, the present invention tracks the head portion 
402 of the user and surrounds the head portion 402 with the 
head tracking box 404. Details of head tracking are dis- 
cussed in detail in U.S. patent application Ser. No. 09/364, 
859, filed Jul. 30, 1999 on behalf of Jeffrey Layne Edwards, 
entitled System, Method And Article Of Manufacture For 
Tracking A Head Of A Camera-Generated Image Of A 
Person, which is incorporated herein by reference in its 
entirety. 

FIG. 14 is an illustration showing the image 400 of the 
user including a line of symmetry 406, in accordance with 
an embodiment of the present invention. The image 400 
includes the head portion 402 of the user, the bead tracking 
box 404, and a line of symmetry 406. The line of symmetry 
406 is a line representing the center of the user's face, 

FIG. 15 is an illustration showing a graph 410 of sym- 
metry test locations 412. In determining the line of symme- 
try 406, the image 400 of the user is divided into two head 
sections 414a and 414^. The head sections 414a and 4146 
are preferably processed using a Gaussian filter and a 
Laplacian filter. Next, the processed bead sections 414a and 
4146 are mathematically overlaid to determine the best line 
of symmetry 406. When charted on graph 410, the best line 
of symmetry 406 is located at the highest peak 416 on the 
graph 410. The same approach is utilized to determine the 
line symmetry 406 when the user's image is rotated, as in 
400'. Thus, the line of synmietry 406 is determined by the 
present invention regardless of the head tilt of the user in the 
video images. 

FIG. 16 is an illustration stowing the image 400 of the 
user including feature sample lines 420, and feature lines 
422, in accordance with an embodiment of the present 
invention. Each feature sample line is preferably parallel to 
the line of symmetry 406, and is about as long as the trackiiig 
box. Each feature line is preferably perpendicular to the line 
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of symmetry 406;^begins .with' ao- initial' operation -502,-;- 
wherein pre-process operations are performed. Pre-process 
operations inchide obtaining video images of a user, and 
initializing the visual chat. 

In a head tracking operation 504, the head portion of a j 
user's image is tracked, and the location of the head rect- 
angle is determined. Details of bead tracking are discussed 
in detafl in U.S. patent application Ser No. 09/364,859. filed 
Jul. 30, 1999 on behalf of Jeffrey Layne Edwards, entitled 
System, Method And Article Of Manufacnire For Tracking lo 
A Head Of A Camera-Generated Image Of A Person, which 
is incorporated herein by reference in its entirety. 

In a blurring operation 506, a blurring filter is applied to 
the user's image within the head rectangle. Preferably, the 
blurring filter is a wide aperture Gaussian filter. The wide 15 
aperture Gaussian filter blurs the head portion of the person 
image. 

Next in a sharpening operation 508, a sharpening filter is 
applied to the user's image within the head rectangle. 
Preferably, the sharpening filter is a Laplacian filter. The 20 
Laplacian filter sharpens the head portion of the person 
image. 

A symmetry analyzer is then applied to the processed 
head image, in a symmetry operation 510. The symmetry 
analyzer determines a line of symmetry that defines the 25 
center of the user's face. In determining the line of 
symmetry, the image of the user is divided into two head 
sections. Next, the two head sections are mathematically 
overlaid to determine the best location of symmetry. When 
charted on graph, the best location of synametry is at the 30 
highest peak on the graph. The same approach is utilized to 
determine the line symmetry when the user's image is 
rotated, as in. Thus, the line of symmetry is determined by 
the present invention regardless of the head lilt of the user 
in the video images. Further details on determining the line 35 
of symmetry can be found in "Practical Real-Time Imaging 
Stereo Matcher", by H. K. Nishihara, Optical Engineering 
(Sept/Oct. 1984), incorporated herein by reference in its 
entirety. 

In a feature placement operation 512, the locations of the 40 
user's features within the video image are determined. After 
determining the feamre locations, the system is able to 
accurately sample the feature locations to obtain images of 
the user's 

FIG. 17 is an illustration showing results 423 along the 45 
feature sample lines 420, in accordance with an embodiment 
of the present invention. The results 421 include a Gaussian 
graph 424, a Laplacian graph 426, and a product graph 428. 
The Gausian graph 424 includes Gaussian results 430 firom 
applying a Gaussian filter to the user's image along a feature so 
sample line 420. The Laplacian graph 426 includes Lapla- 
cian results 432 from applying a Laplacian filter to the user's 
image along a feature sample line 420. Finally, the product 
graph 428 includes product results 434 from multiplying 
together the Gaussian results 430 and the Laplacian results 55 
432. 

After tacking the user's head portion, locations of the 
feature sample lines 420 are determined. The Gaussian result 
430 and the Laplacian resuh 432 along each feature sample 
420 line are multiplied together to obtain the product results 60 
434. The dark feature areas of the user's eyes and mouth 
create feature peaks 436 in the product results 434. The 
location of the feature lines 422 are then determined to be 
the locations of the feature peaks 436 in the product graph 
428. 65 

FIG. 18 is an illustration showing the image 400 of the 
user including the intcrocular spacing 440, in accordance 



with an -embodiment .of the present- iDvention:--The - feature > 
fine 422 corresponding to the user's eye location is termed 
the "ocular feature line." The interocular spacing 440 is the 
distance between the center of the eyes along the ocular 
feature line. This distance is made symmetric about the line 
of symmetry and used to determine the location of the user's 
eyes. 

FIG. 19 is a graph 450 showing the interocular spacing 
440 of a user, in accordance with an aspect of the present 
invention. To determine the interocular spacing 440, a 
Gaussian curve 452 is sampled along the ocular feature line. 
An average value 454 is then computed for the Gaussian 
curve 452. The user's eyes create ocular wells 456 below the 
average value 454 of the Gaussian curve 452. A weighted 
average 458 is then determined for each ocular well 456. Hie 
linear distance between the weighted averages is the ocular 
spacing 440. This distance is then made symmetric about the 
line of symmetry 406 to determine the location of the center 
of the user's eyes in the video images. 

Referring next to FIG. 20, a method 500 for tracking the 
user's face is shown, in accordance with one aspect of the 
present invention. The method 500 features, such as the 
user's eyes and mouth. These feature images can then be 
blended in a character image for the visual chat session. 

Finally, in a fine tune operation 514, the tracking results 
are transmitted back into the system to help in continued 
face tracking efforts. In this maimer, the accuracy of the face 
tracking is improved over time. 

FIG. 21 is a flowchart showing a method 512 for deter- 
mining feature placement, in accordance with another 
embodiment of the present invention. The method 512 
begins with an initial operation 600 in which various pre- 
process operations are performed. Pre-process operations 
include determining the bcation of the head and determin- 
ing the line of symmetry for the face. 

In a sample line operation 602, the location of feature 
sample lines is determined. Preferably, the feature sample 
lines are located parallel to, and made symmetric around, the 
line of symmetry. The feature sample lines are used to 
determine the location of the feature lines. 

Next, in a product operation 604 a Gaussian result and a 
Laplacian result are multiplied along the feature sample 
lines. After the locations of feature sample lines are 
determined, a Gaussian result and a Laplacian result along 
each feature sample hue are multiplied together to obtain 
product results along cadi feature sample line. 

In a feature placement operation 606, feature lines are 
located along the feature sample lines. The dark feature 
areas of the user's eyes and mouth create feature peaks in the 
product results along each feature sample line. These feature 
peaks are therefore used to determine the location of the 
feamre lines. Thus, the feature lines are located at the 
locations of the feature peaks in the product graph for each 
feamre sample line. 

The interocular spacing is then determined, in an 
interocular spacing operation 608. The interocular spacing is 
the distance between the center of the user's eyes in the 
video images. After determining the interocular spacing, the 
user's eyes can then be accurately sampled from the user's 
video image. 

Finally, in operation 610, the system utilizes the features 
lines and interocular spacing to sample the user's features 
from the video images and blend them into the character 
image. 

Turning next to FIG. 22, a method 608 for determining 
interocular spacing is shown, in accordance with one aspect 
of the present invention. The interocular spacing is the 
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distance between the center of the> eyes- along the ocular 
feature line. This distance is made symmetric about the line 
of symmetry and used to determine the location of the user's 
eyes. In an initial operation 700, various pre-process opera- 
tions are performed. Pre-process operations include deter- 5 
mining the location of a line of symmetry for the user's face 
and determining the location of feature lines. 

In a Gaussian curve operation 702, a Gaussian curve is 
determined along an ocular feature line. The feature line 
corresponding to the user's eye location is termed the lO 
"ocular feature line." A Gaussian curve is sampled along this 
ocular feature line. The Gaussian curve may be generated 
anew or sampled from the previous Gaussian filter applied 
to the head rectangle^ as described above. 

An average value is then computed for the Gaussian 15 
curve, in an averaging operation 704. The average vahie is 
used to determine the location of the eyes along the ocular 
feature line, as described subsequently. 

Next, in a weighting operation 706, the center of mass of 
each ocular well is determined. The user's eyes create ocular 20 
wells below the average value of the Gaussian curve, while 
the user's nose creates a peak along the Gaussian curve. The 
ocular wells are used to determine the location of the user's 
eyes in the video image, by way of weighted averages. A 
weighted average is determined for each ocular well located 25 
below the average value of the Gaussian curve. The linear 
distance between the determined weighted averages is the 
ocular spacing. This distance is then made symmetric about 
the line of symmetry to determine the location of the center 
of the user's eyes in the video images. 30 

Finally, in operation 708, the interocular spacing is then 
used by the system to sample the user's eyes in the video 
images. It should be borne in mind that the ocular spacing 
preferably always made symmetric around the line of sym- 
metry. In this manner, movement of the user's eyes docs not 35 
aSect the tracked eye location. Since both eyes move in the 
same manner, the distance between the darkest points of the 
eyes, i.e., the pupils, remains essentially constant. 

While the present invention has been described in terms 
of several preferred embodiments, there are many 40 
alterations, permutations, and equivalents, which fall within 
the scope of this invention. It should also be noted that there 
are many alternative ways of implementing the methods and 
apparatuses of the present invention. It is therefore intended 
that the following appended claims be interpreted as includ- 45 
ing all such alteration, permutations, and equivalents as fall 
within the tme spirit and scope of the present invention. 

What is claimed is: 

1. A method for providing a visual chat session compris- 
ing: 50 
reading a character image into memory; 
receiving video images including image data of a person, 

wherein the video images include a plurality of video 

frames; 

tracking a portion of a person image in the video images; 
executing an image gathering operation for obtaining a 

portion of the person image; and 
executing an influencing operation for influencing the 

character image utilizing the obtained portion of the go 

person image; 

wherein the portion of the person image is obtained from 
a head portion of the person image, and wherein the 
influencing operation includes a blending operation 
comprising the operation of: 65 
blending the obtained portion of the person image into 
the character image to create a blended character 



.r ;image,-whereia the blerxled character image includes, 
features corresponding to features of the person 
image, and wherein the features of the blended 
character image change as the features of the person 
change; 

wherein the blending operation comprises the operations 
of: 

determining an average flesh tone value for each fea- 
ture; 

determining a color relationship between each person 
pixel value in a feature and the determined average 
flesh tone value of the feature; and 

generating a new color value for a corresponding 
character pixel of the character image, wherein the 
new color value is generated utilizing the determined 
color relationship. 

2. A method as recited in claim 1, wherein the color 
relationship is a ratio. 

3. A method as recited in claim 1, wherein the color 
relationship is a plurality of feature color ratios each corre- 
sponding to a color channel of a person pixel in the person 
image. 

4. A method as recited in claim 3, wherein a feature color 
ratio for a color channel is a ratio of: 

a difference between a value of the color chatmel for the 
person pixel and a value of a corresponding color 
charmel for the average flesh tone value, to 

a difierenoe between a value of the color channel at 100% 
and a value of the corresponding color channel for the 
average flesh tone value, 

wherein the color channel for the person pixel is greater 
than or equal to the value of a corresponding color 
channel for the average flesh tone value. 

5. A method as recited in claim 4, wherein the new color 
value is generated by adjusting values of color channels of 
the corresponding character pixel, wherein each a value of 
a color channel is lightened by an amount of: 

a difference between the value of the color channel at 
100% and a value of the corresponding color channel of 
the character pixel, 

the difference being multiplied by the corresponding 
feature ratio. 

6. A method as recited in claim 3, wherein a feature color 
ratio for a color channel is a ratio of: 

a difference between a value of the color channel for the 
person pixel and a value of a corresponding color 
charmel for the average flesh tone value, to 

a value for the corresponding color channel for the 
average flesh tone value, 

wherein the color channel for the person pixel is less than 
the value of a corresponding color channel for the 
average flesh tone value. 

7. A method as recited in claim 6, wherein the new color 
value is generated by adjusting values of color channels of 
the corresponding charaaer pixel, wherein each a value of 
a color channel is darkened by an amount of: 

a value of a corresponding color channel of the character 
pixel multiphed by the corresponding feature ratio. 

8. A method as recited in claim 3, further comprising the 
operations of: 

compressing the feature color ratios; and 
transmitting the feature color ratios over a network. 

9. A method for providing a visual chat session compris- 
ing: 

reading a character image into memory; 
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- rcoeivmg video -images including'-'iinage data of a person,, 
wherein the video images include a plurality of video 
&ames; 

tracking a portion of a person image in the video images; 
executing an image gathering operation for obtaining a ^ 

portion of the person image; and 
executing an influencing operation for influencing the 

character image utilizing the obtained portion of the 

person image; 

wherein the process of tracking the portion of the person 
image includes the operations of: 
determining a basic location of the head portion of the 

person image; 
processing the head portion of the person image; 
determining a hne of symmetry for the head portion; 

and 

determining the location of features within the head 
portion; 

wherein processing the head portion includes the opera- 
tions of: 

applying a blurring operation to the head portion, 
wherein the blurring operation is utilized to blur 
the head portion of the person image; and 

applying a sharpening operation to the head portion, 
wherein the sharpening operation is utilized to ^ 
sharpen the head portion of the person image. 

10. A method as recited in claim 9, wherein the blurring 
operation comprises applying a Gaussian Alter to the bead 
portion, and wherein the Gaussian filter is a wide aperture 
filter. 

11. A method as recited in claim 9, wherein the sharpening 
operation comprises applying a Laplacian filter to the bead 
portion. 

12. A method for providing a visual chat session com- 
prising: 

reading a character image into memory; 

receiving video images including image data of a person, 

wherein the video images include a plurality of video 

frames; ^ 
tracking a portion of a person image in the video images; 
executing an image gathering operation for obtaining a 

portion of the person image; and 
executing an influencing operation for influencing the 

character image utilizing the obtained portion of the 45 

person image; 

wherein the process of tracking the portion of the person 
image includes the operations of: 
determining a basic location of tbe head portion of the 

person image; 50 
processing the head portion of the person image; 
determining a line of symmetry for the head portion; 

and 

determining the location of feattires within the head 
portion; 55 

wherein the operation of determining the location of 
features within the head portion includes the opera- 
tions of: 

determining a location for feature sample lines with 
the head portion of the person image; 60 

sampling a Gaussian result along the feature sample 
lines; 

sampling a Laplacian result along the feature 

samples lines; 
generating featiu'e sample summary resiilt by multi- 65 

plying the Gaussian result to tbe Laplacian result 

along tbe features samples lines; 



. -determining a location forfeature lines along feature- 
sample lines at locations corresponding to peaks in 
the feature sample summary result, wherein the 
feature lines include an ocular feature line; and 
determining interocular spacing along the ocular 
feature line. 

13. A method as recited in claim 12, wherein the operation 
of determining interocular spacing comprises the operations 
of: 

determining a Gaussian curve along the ocular featutre 
line; 

computing an average value of the Gaussian curve; 

determining a first weighted average of the Gaussian 
curve below the average value of the Gaussian curve, 
wherein the first weighted average represents a first 
center of mass of a first ocular well along the Gaussian 
curve; 

determining a second weighted average of the Gaussian 
curve below the average value of the Gaussian curve, 
wherein the second weighted average represents a 
second center of mass of a second ocular well along the 
Gaussian curve; and 

computing a linear difference between the first center of 
mass and the second center of mass. 

14. A computer system for providing a visual chat session, 
the computer system suitable for use in a distributed com- 
puting environment, the computer system comprising: 

a computer processor; 

memory for storing a character image; 

a camera for obtaining video images including image data 
of a person, wherein the video images include a plu- 
rality of video frames; and 

logic embodied on a computer readable medium, includ- 
ing computer executable instructions for: 
tracking a head portion of a person image in the video 
images; 

obtaining a portion of tbe person image, wherein the 
portion of the person image is obtained from the 
head portion of the person image; and 
blending the portion of the person image into the 
character image, wherein the blended character 
image includes features corre^onding to features of 
the person image, and wherein the features of the 
blended character image change as the features of the 
person change; 
wherein the computer executable instructions for blend- 
ing comprises: 

logic for determining an average flesh tone value for 
each feature; 

logic for determining a color relationship between each 
person pixel value in a feature and the determined 
average fiesh tone value of the feature; and 

logic for generating a new color value for a correspond- 
ing character pixel of the character image, wherein 
the new color value is generated utilizing tbe deter- 
mined color relationship. 

15. A computer system as recited in claim 14, wherein tbe 
color relationship is a ratio. 

16. A computer system as recited in claim 14, wherein the 
color relationship is a plurality of feature color ratios each 
corresponding to a color channel of a person pixel in the 
person image. 

17. A computer system as recited in claim 16, wherein a 
feature color ratio for a color channel is a ratio of; 

a difference between a value of the color channel for the 
person pixel and a value of a corresponding color 
diannel for the average flesh tone value, to 
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. - a difference between a value of the color channel at 100%. 
and a value of the corresponding color channel for the 
average flesh tone value, 
wherein the color chaimel for the person pixel is greater 
than or equal lo the value of a corresponding color 
channel for the average flesh tone value. 

18. A computer systena as recited in claim 17, wherein the 
new color value is generated by adjusting values of cobr 
channels of the corresponding character pixel, wherein each 
a value of a color channel is hghtcncd by an amount of: 

a difference between the value of the color channel at 
100% and a value of the corresponding color channel of 
the character pixel, 

the difference being multiplied by the corresponding 
feature ratio. 

19. A computer system as recited in claim 16, wherein a 
feature color ratio for a color channel is a ratio of: 

a difference between a value of the color channel for the 
person pixel and a value of a corresponding color 
channel for the average flesh tone value, to 

a value for the corresponding color channel for the 
average flesh tone value, 

wherein the color channel for the person pixel is less than 
the value of a corresponding color chaimel for the 
average flesh tone value. 

20. A computer system as recited in claim 19, wherein the 
new color value is generated by adjusting values of color 
channels of the corresponding character pixel, wherein each 
a value of a color channel is darkened by an amoimt of: 

a value of a corresponding color diannel of the character 
pixel multiplied by the corresponding feature ratio. 

21. A computer system as recited in claim 16, wherein the 
logic includes computer executable instructions for 

compressing the feature color ratios; and 

logic for generating a new color value for a corresponding 
character pixel of the character image, wherein the new 
color value is generated utilizing the determined color 
relationship. 

22. A computer system for providing a visual chat session, 
the computer system suitable for use in a distributed com- 
puting environment, the computer system comprising: 

a computer processor; 

memory for storing a character image; 

a camera for obtaining video images including image data 
of a person, wherein the video images include a plu- 
rah'ty of video frames; and 

logic embodied on a computer readable medium, includ- 
ing computer executable instructions for: 
tracking a bead portion of a person image in the video 
images; 

obtaining a portion of the person image, wherein the 
portion of the person image is obtained from the 
head portion of the person image; and 

blending the portion of the person image into the 
character image, wherein the blended character 
image includes features corresponding to features of 
the person image, and wherein the features of the 
blended character image change as the features of the 
person change; 
wherein the computer executable instructions for tracking 

the head portion comprises: 

logic for determining a basic location of the head 

portion of the person image; 
logic for processing the head portion of the person 

image; 
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: logic for determining a line of symmetry' for* the* head 
portion; and 

logic for determining the location of features within the 
head portion; 

wherein the logic for processing the head portion com- 
prises: 

logic for applying a blurring operation to the head 
portion, wherein the bltming operation is utilized to 
blur the head portion of the person image; and 

logic for applying a sharpening operation to the head 
portion, wherein the sharpening operation is utilized 
to sharpen the head portion of the person image. 

23. A computer system as recited in claim 22, wherein the 
blurring operation comprises applying a Gaussian filter to 
the head portion, and wherein the Gaussian filter is a wide 
aperture filter. 

24. A computer system as recited in claim 22, wherein the 
sharpening operation comprises applying a Lapladan filter 
to the head portion. 

25. Aoomputer system for providing a visual chat session, 
the computer system suitable for use in a distributed com- 
puting environment, the computer system comprising: 

a computer processor; 

memory for storing a character image; 

a camera for obtaining video images including image data 
of a person, wherein the video images include a plu- 
rality of video frames; and 

logic embodied on a computer readable medium, includ- 
ing computer executable instructions for: 
tracking a head portion of a person image in the video 
images; 

obtaining a portion of the person image, wherein the 
portion of the person image is obtained from the 
head portion of the person image; and 

blending the portion of the person image into the 
character image, wherein the blended character 
image includes features corresponding to features of 
the person image, and wherein the features of the 
blended character image change as the features of the 
person change; 

wherein the computer executable instructions for track- 
ing the head portion comprises: 
logic for determining a basic location of the head 

portion of the person image; 
logic for processing the head portion of the person 
image; 

logic for determining a line of symmetry for the head 
portion; and 

logic for determining the location of features within 

the bead portion; 
wherein the logic for locating the features within the 

bead portion comprises: 

logic for determining locations of feature sample 
lines with the head portion of the person 
image; 

logic for sampling a Gaussian result along the 

feature sample lines; 
logic for sampling a Laplacian result along the 

feature samples lines; 
logic for generating feature sample summary 

result by multiplying the Gaussian result to the 

Laplacian result along the feamres samples 

lines; 

logic for determining locations of feature lines 
along feature sample lines at locations corre- 
sponding to peaks in the feature sample sum- 
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tt r-. i- ,*-v mary result, wherein the feature lines include 
an ocular feature line; and 
logic for determining interocular spacing along 
the ocular feature line. 

26. A computer system as recited in claim 25, wherein the 
logic for determining interocular spacing comprises: 

computer executable insUuctions for determining a Gaus- 
sian curve along the ocular feamre line; 

computer executable instructions for computing an aver- 
age value of the Gaussian curve; 

computer executable instructions for determining a first 
weighted average of the Gaussian curve below the 
average value of the Gaussian curve, wherein the first 
weighted average represents a first center of mass of a 
first ocular well along the Gaussian curve; 

computer executable instructions for determining a sec- 
ond weighted average of the Gaussian curve below the 
average value of the Gaussian curve, wherein the 
second weighted average represents a second center of 20 
mass of a second ocular well along the Gaussian curve; 
and 

computer executable instructions for computing a linear 
difference between the first center of mass and the 
second center of mass. 

27. A computer program embodied on a computer read- 
able medium for providing a visual chat session, the com- 
puter program comprising: 

a character input code segment that reads a character 
image into memory; 

a video code segment that receives video images includ- 
ing image data of a person, wherein the video images 
include a plurality of video frames; 

a U-acking code segment that tracks a head portion of a 
person image in the video images; 

a gathering code segment that obtains a portion of the 
person image, wherein the portion of the person image 
is obtained fi-om the head portion of the person image; 
and 

a blending code segment that blends the portion of the 
person image into the character image, wherein the 
blended character image includes features correspond- 
ing to features of the person image, and wherein the 
features of the blended character image change as the 
feamres of the person change; 

wherein the blending code segment comprises: 
logic for determining an average flesh tone value for 
each feature; 

logic for determining a color relationship between each 
person pixel value in a feature and the determined 
average flesh tone value of the feature; and 

logic for generating a new color value for a correspond- 
ing character pixel of the character image, wherein 
the new color value is generated utilizing the deter- 
mined color relationship. 

28. A computer program as recited in claim 27, wherein 
the color relationship is a ratio. 

29. A computer program as recited in claim 27, wherein 
the color relationship is a plurality of feamre color ratios 
each corresponding to a color channel of a person pixel in 
the person image. 

30. A computer program as recited in claim 29, wherein 
a feature color ratio for a color channel is a ratio of: 

a difference between a value of the color channel for the 
person pixel and a value of a corresponding cobr 
channel for the average flesh tone value, to 
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: a difference between a value of the color channel at- 100% 
and a value of the corresponding color chatmel for the 
average flesh tone value, 
wherein the color channel for the person pixel is greater 
than or equal to the value of a correspondiog color 
channel for the average flesh tone value. 

31. A computer program as recited in claim 30, wherein 
the new color value is generated by adjusting values of color 
channels of the corresponding character pixel, wherein each 
a value of a color channel is lightened by an amount of: 

a difference between the value of the color channel at 
100% and a value of the corresponding color channel of 
the character pixel, 

the difference being multiplied by the corresponding 
feature ratio. 

32. A computer program as recited in claim 29, wherein 
a feature color ratio for a color channel is a ratio of: 

a difference between a value of the color channel for the 
person pixel and a value of a corresponding color 
channel for the average flesh tone value, to 

a value for the corresponding color channel for the 
average flesh tone value, 

wherein the color channel for the person pixel is less than 
the value of a conesponding color channel for the 
average flesh tone value. 

33. A computer program as recited in claim 32, wherein 
the new color value is generated by adjusting values of color 
channels of the corresponding character pixel, wherein each 
a value of a color channel is darkened by an amount of: 

a value of a corresponding color channel of the character 
pixel multiplied by the corresponding feature ratio. 

34. A computer program as recited in claim 29, further 
comprising: 

a code segment that compresses the feature color ratios; 
and 

a code segment that transmits the feamre color ratios over 
a network. 

35. A computer program embodied on a computer read- 
able medium for providing a visual chat session, the com- 
puter program comprising: 

a character input code segment that reads a character 
image into memory; 

a video code segment that receives video images includ- 
ing image data of a person, wherein the video images 
include a plurality of video frames; 

a tracking code segment that tracks a head portion of a 
person image in the video images; 

a gathering code segment that obtains a portion of the 
person image, wherein the portion of the person image 
is obtained from the head portion of the person image; 
and 

a blending code segment that blends the portion of the 
person image into the character image, wherein the 
blended character image includes feamres correspond- 
ing to features of the person image, and wherein the 
features of the blended character image change as the 
features of the person change; 
wherein the tracking code segment comprises: 

a head locating code segment for determining a basic 

location of the head portion of the person image; 
a processing code segment for processing the head 

portion of the person image; 
a symmetry code segment for determining a line of 

symmetry for the head portion; and 
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a ^feature locating code segpaent ^for "determining* the 
location of features within the head portion; 
wherein processing code segment comprises: 

logic for applying a blurring operation to the head 
portion, wherein the blurring operation is utilized to 5 
blur the bead portion of the peison image; and 

logic for applying a sharpening operation to the head 
portion, wherein the sharpening operation is utilized 
to sharpen the head portion of the person image. 

36. A computer program as recited in claim 35, wherein 10 
the blurring operation comprises applying a Gaussian filter 

to the head portion, and wherein the Gaussian filter is a wide 
aperture filter. 

37. A computer program as recited in claim 35, wherein 
the sharpening operation comprises applying a Laplacian i5 
filter to the head portion. 

38. A computer program embodied on a computer read- 
able medium for providing a visual chat session, the com- 
puter program comprising: 

a character input code segment that reads a character 
image into memory; 

a video code segment that receives video images includ- 
ing image data of a person, wherein the video images 
include a plurality of video frames; 

a tracking code segment that tracks a head portion of a 
person image in the video images; 

a gathering code segment that obtains a portion of the 
person image, wherein the portion of the person image 
is obtained from the head portion of the person image; 30 
and 

a blending code segment that blends the portion of the 
person image into the character image, wherein the 
blended character image includes features correspond- 
ing to features of the person image, and wherein the 
features of the blended character image change as the 
features of the person change; 

wherein the tracking code segment comprises: 

a head locating code segment for determining a basic 

location of the head portion of the person image; ^ 
a processing code segment for processing the head 
portion of the person image; 



- * .^~a-* symmetry code segment 'for ^determining: a line of 
symmetry for the head portion; and 
a feature locating code segment for determining the 

location of features within the head portion; 
wherein the feature locating code segment comprises: 
logic for determining a location for feature sample 
lines with the head portion of the person image; 
logic for sampling a Gaussian result along the feature 
sample lines; 

logic for sampling a Laplacian result along the 

feature samples lines; 
logic for generating feature sample summary result 
by multiplying the Gaussian result to the Lapla- 
cian result along the features samples lines; 
logic for determining a location for feature lines 
along feature sample lines at locations corre- 
sponding to peaks in the feature sample sunmiary 
result, wherein the feature lines include an ocular 
feattu^ line; and 
logic for determining interocular spacing along the 
ocular feature line. 
39. A computer program as recited in claim 38, wherein 
the logic for determining interocular spacing comprises: 
logic for determining a Gaussian curve along the ocular 
feature line; 

logic for computing an average vahie of the Gaussian 
curve; 

logic for determining a first weighted average of the 
Gaussian curve below the average value of the Gaus- 
sian curve, wherein the first weighted average repre- 
sents a first center of mass of a first ocular well along 
the Gaussian curve; 

logic for determining a second weighted average of the 
Gaussian curve below the average value of the Gaus- 
sian curve, wherein the second weighted average rep- 
resents a second center of mass of a second ocular well 
along the Gaussian curve; and 

logic for computing a linear difference between the first 
center of mass and the second center of mass. 
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